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(54) Use of processor bus for the transmission of i/o traffic 

(57) The exctiange of commands and data between 
I/O devices, such as DASDs. and a computer system, 
prefenrably a muWprocessor computer system, usually 
takes place via I/O adapters. The question arises of how 
to couple these 1/0 adapters to the conputer system In 
prior art solutions, the I/O adapters were either attached 
to the secorxi level cache or to a memory bus. The 
present invention relates to a method of coupling the 
stream of I/O commands arvJ I/O data to the computer 
system via the processor txisses. Because of the high 
bandwidth of the processor busses, an additional trans- 
mission of I/O data does not disturb regular data traffic 
on the processor kxjs. One advantage of using the proc- 
essor busses for the transmission of I/O data is that pins 
of the second level cache ch^don*t have to t>e used for 
the attachment of I/O adapters any more and thus 
become available for other purposes. For example, the 
t>andwidths of processa txisses or memory kxxsses can 
be increased, or additional processor busses can be 
acconrxxiated. 

The invention is especially useful with regard to a 
cotpGngfadlity. In a computer system which consists of 
a muftrtude of coupled computer systems, a coipling 
fadlity is the central computer system respons&le tor 
the management of data flow between the attached 
conputer systene and for the management of a rather 
large central memory. The large amount of I/O and 
intersystem traffic arising in such a system can be han- 
dled by the invention proposed. 
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Description 

FIELD OF THE INVENmON 

The present invention relates to a method of cou- 
pling I/O traffic to a computer system, especially to a 
multq:>roces8or computer system. 

BACKGROUND OF THE INVENTION 

A typical multiprocessor computer system com- 
prises several processors, cache memories of different 
hierarchy levels, storage controlling devices, a main 
memory, and means that control data exchange with I/O 
devices. 

When descrit}ing the different memory levels the 
processors address, one has to start with the first level 
cache. Each processor has its own. individually 
assigned first level cache integrated on-chp. Ther^e. 
the wiring delay is small for read and write operations to 
this cache and the latency is low, which means that a 
very fast access to the data said instructions stored in 
the first level cache is possible 

The n&A level in tfie hierarchy of cache storage 
means is the second level cache. The task of this cache 
is to provide data exchange with t3e first level cache 
The storage size of the second level cache is much 
greater than that of the first level cache, and usuaDy, the 
second level cache is not integrated on the processor 
chip, txjt placed on a dedicated ch^ (or set of chips). 
Therefore, the latency for a second level cacfie access 
is not as low as for th^ first level cacha 

There exist several concepts for the realization of a 
secorxJ level cache structure in a multiprocessor com- 
puter system. 

One of these concepts is to assign an individual 
second level cache to each processor, which means 
that each processor can only address the data in its per- 
sonal cache. Therefore, each of these "dedicated 
caches" holds exactly the data units reguested by its 
processor; and of course, two of these caches usually 
hold cfifferent data units. 

L.et us suppose that processor 1 writes to a cache 
line contained in its second level cache, and tfiat a copy 
of said line ts also contained in a second level cache 
owned by a different processor, e. g. processor 3. In 
the moment the write access to cache 1 occurs, the 
page in cache 3 is not valid any mora Therefore, a 
mecfianism has to exist (the so^ed switchingnand- 
snoop'ng network) that s^nals to cache 3 that the data 
of a certain line has become invalid, in order to maintain 
data integrity. 

The necessity of keeping track of efferent versnns 
of one and the same cache line in tfw second level 
cache falls away when a shared cache structure is real- 
ized. "Shared" means that each processor addresses 
the same second level cache storage space The 
accessft3le lines that have been copied to and that are 



new contained in the second level cache are the same 
for all processors. 

This also implies that a shared second level cache 
built in order to have a performance similar to that of 

5 individually assigned second level caches has to have 
an absolute storage size that is several times the size of 
one of sakJ individual caches. When realizing said 
shared cache concept, one usually employs several 
second level cache chips in parallel. 

10 B^ween second level cache and processors, oata 
istransfened via so-called processor busses. As a cer- 
tain line that a processor wants to address can be 
stored on any of the second level cache dhips, each 
second level cache chip has to be connected to each of 

IS the processors of the nmiftiprocessor system via one of 
saki processor busses. In order to take advantage of a 
low latency of the second level cache, each processor 
bus has to be capable of high-speed data transmisskxi 
and provkie a high-bandwktth communk:atk>n link. Usu- 

20 ally, regular instructkMi and data f kiw on the processor 
bi^ from the second level cache to the processor, and 
vice versa, only requires about 20% of the maximum 
data transfer rate po6si)le on sakJ processor bus. 
Alt the devk^es described so far, the processors and 

25 the second level cache chips, may be mounted on one 
multi-chip modula This module exchanges data via 
menxMy txjsses with a main memory, consisting of at 
least one memory card. Each memory card comprises 
a storage controller, whrch also provides the ckx^king 

30 system for the whole ntemory card, and several mem- 
ory banks. 

When a processor requests data which is not 
accessible in the second level cache yet, lines have to 
t>e transferred from a menfK)ry bank of the main memory 

35 to the second level cacha On the other hand, cache 
lines that have been modified in the second level cache 
have to be rewritten to the main memory in order to 
ensure data integrity. 

Data exchanged between the computer system on 

40 the one hand and external devk^es, such as discs, 
tapes, etc. on the other hand is generally referred to as 
l/O-traffk:. Usually. I/O devices are coupled to an IAD 
adapter, whk;h is connected to the computer system via 
the second level cacha One or two of the processors of 

45 the multiprocessor system, the socalled servk^e assist 
processors (SAPs), are especially responsft)le for han- 
dling data exchange between the I/O devices and the 
main memory. 

In a rrailtiprocessor system, each of the second 

50 level cache chps has to acconrKxiate processor busses 
connecting sakJ second level cache chip to each of the 
processors, memory busses linking sakl cache chip to a 
menK>ry card, and additional data Gnks for exchanging 
data with I/O devces. 

55 All these connectkxis require a lot of the second 
level cache chip's pins. 

UsuaOy, a chp has a skie length of 14.5 or 16 mm. 
The lower skie of the chip bears an array of blobs of sol- 
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der. in order to connect the chp to the ceramic sub- 
strate, the mufthchip module. For a typical chip, the 
rujrTt>er of soldering tags is 1240. One would be 
tenpted to increase the number of pins k>y simply 
increasing the chp siza When soldering the chip to the 5 
substrate according to the ti^) chip technique*' 
(descrbed, tor example, in L F. Miller's article "Control- 
led Collapse ReflGw Ch^ Joining", IBM J. Res. 
Developi, May 1969. p. 239-250). und thus heating up 
both the chip and the sut)5tFate. shear strains arise w 
because of the cBfferent coefficients of expansion of cNp 
arxi sut)strate. The occurence of tfiese shear strains 
imposes physical limits to the increase in chp size and 
to the increase in pin count 

Therefore, pins are valuable, and any method tfiat is 
contributes to saving pins, especially pins of a second 
level cache chip, permits a better layout arxJ improved 
performanca 

One method for saving pins is the introduction of 
switchingnrteans. For example, both a nrieniory bus and 20 
a data link to an i/O adapter can be coupled to a switch- 
ing networkwhich. for its part, ^cormected to a second 
la^el cache chip. Thus, the memory bus arvJ the I/O 
data Bnk share the same cache chip pins, and tfierefore. 
a kit of the pins that formerly were necessary to acoo- 2s 
nxxiate both the memory bus and the I/O data link 
t>ecome availabla Of course, one does not have to 
switch between a memory bus and an I/O data link, one 
couM as well switch between Enks to different I/O 
devk^ or. in case a multiprocessor system with a dus- so 
tared shared second le^ cache is reaGzed. k>^ween 
connectkxis to (fifferent dusters. 

The introduction of swrtcfies has several disadvan- 
tages, though. For example, let US conskier the case 
that both an I/O data Onk and a memory bus are con- 35 
nected via a switch to a second level cache chp. When 
data traffk; occurs on both the I/O data link and the 
menfxxy bus, the switch can only permit access to the 
secorxJ level cache to one of them, wtrk^h means that 
the other one cani exchange data with the cache at that 40 
time and has to wait for several cydes. Furtheroa the 
implementatfon of additional switches rec^ires addi- 
tional chips. Increasing the nunrt>er of dtps always 
means that packaging becomes more diffcult and tfiat 
the wiring delay of certain signals irx^reasea The main 45 
disadvantage of the use of swrtdies is tfierefore the 
occurence of addtional delays wtuch severely degrade 
system performanca 

So tar, reference has only been made to a single 
multiprocessor system. In order to furtfier increase sys- so 
tem performance, concepts of paraDel computing have 
been realized where several of said muftiprocessor sys- 
tems are conneded. Each of these miitprocessor sys- 
tems stall addresses its incfivkkial memory, but there 
also exists a central shared memory wfuch every multi- ss 
processa system can address. The storage size of this 
central memory is much larger than the size of the 
merrxxies incGvkJually assigned to each rrultprocessor 



system. For each data structure (list, cache, lock struc- 
ture) contained in saki central menrxxy ownerships are 
defined which enable or (fisat)le access of a certain mul- 
tiprocessor system to saki data structure. One of the 
multiprocessor systems, the so-called coupling fadlity, 
is dedicated to managing tx3lh the communicatfon/^yn- 
dironizatfon between the cfifferent multiprocessor sys- 
tems and the access (and access rights) of the 
multiprocessor systems to the central memory whfoh is 
attached to sakJ coipling fadlrty. This implies that all 
data that one of the multprocessor systems fetches 
from or stores to ttie central memory has to be 
exchanged between saki multprocessor system arxi 
the cotpling fadlity. Thus, strong data links have to exist 
t>etween the systems and tfie coupling fadlity, arxi the 
t>andwkith required fw these Gnks in order to permit 
quk:k access of afl rruttiprocessor systems to the cen- 
tral memory is much higfier than the bandwkfth required 
for the transmission of conventfonal I/O traffk: between a 
coirputer system and its external devfces. Therefore, 
with respect to a coupfing fadfity. the question arises of 
how to connect such an addrtkxial intersystem commu- 
nfoatfon link to the computer system 

Again, one sdutfon is the introductkyi of switching 
means in order to couple saki intersystem conununk^a- 
tfon link to the second level cacha With this method, it 
is possS)le to acconrvnodate the link and stSI meet pin 
count requirenrmnts. Because of the additfonal wait 
cydes caused by switching means, the latency of an 
access to the computer system is increased. Thus, the 
introductfon of switching means might t>e a conrect con- 
cept for a coipling fad&ty of today, but for future parallel 
systems, latency wil I have to t>e much fower. 

OBJECT OF THE INVEhmON 

The objed of the present inventfon is therefore to 
provkte rneans for coupling external data traffic to a mul- 
tiprocessor conrputer system without affoctnrig the avaO- 
ability of pins befonging to chips of the second level 
cache. 

Another object of the inventfon is to provkie means 
for coupling external data trafffo to a multprocessor 
computer system that permit a tow latency access to 
said computer systent It is iirportant that additfonal 
delays are avokied when data is transferred to and from 
saki corrputer system. 

Afurtfier object of the invention is to provkie means 
for coupling external data traffk; to a multprocessor 
conputer system that are able to handle hightend- 
wktth data Gnks. 

Such higfvbandwkith communKatfon links are 
required when connecting a number of multprocessor 
computer systems to a central coipGng facOity. A further 
objed of the inventfon is therefore to provkie means tor 
connecting h^fvbandwkith intersystem communfoatfon 
links to a coupBng fadfity and to give a concept for ^i- 
dent tnt^system comnrumcatfon. wfuch is a prerequi- 
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site for high performance parallel computing. 
SUMMARY OF THE INVENriKDN 

The underlying concept of the inventbn is to use s 
the processor bus for the transmission of I/O data (or, 
more generally, any kind of external data) to the second 
lerel cacha The bandwidth of the processor bus is so 
large that only atxxit 20% of this bandwidth are actually 
used for data transmission between processor arxJ sec- io 
end level cacha The t>arKiwidth required for the trans- 
mission of VO data is ntuch smaller than the barxiwidth 
necessary for data exchange between second level 
cache and processor. In a typical mult9>rocessor com- 
puter system, tt amounts to about 1% of the bandwidth IS 
of the processor bus. Therefore, it is possible, by means 
of an adequately designed bus protocol, to transmit 
both the data traffic between processor and second 
lerel cache and the I/O traffic via the same processor 
bus. Because of said high bandwidth of the processor 20 
bus, the additional transmission of UO data does not 
ot>struct the regular data traffic between processor and 
second level cache. 

One advantage of coipling I/O traffic to a rnult^Hoc- 
essor computer system via the processor bus is that no 2s 
extra pins of the second level cache have to be used 
anymore As pin count restrictions are espedaOy critical 
for the second level cache chips, the solution provided 
t)y this invention contributes to pin economy. 

The pins saved in this way can be used for other 30 
tasks, for exanrple for connecting more processors to 
the second level cache, or for increasing the bandwidth 
of the existing Bnks t>etween the processors arvi the 
second level cacha In any case, these eodra pins permit 
a performance increase of the multiprocessor computer ss 
system, and th^ alfow for more ftexfoarty when design- 
ing a new computer system because pin count restric- 
tions doni impose that severe limits any more. 

There are several ways how the invention can be 
Implemented. One can connect an I/O adapter io a 40 
processor in order to transfer I/O data via the processor 
bus to the second level cach& The advant^e of con- 
necting I/O adapters directly to the processor is that no 
extra cNps are required, which would also slow down 
data transfer. Usually, tfre avaOability of a processor 4s 
ch^*s pins is not a problem, because a processor does 
not have to host tfiat many data Isnks. The processor 
determines when to use the processor txis for the trans- 
mission of regular data traffic between second level 
cache and processor, and when to transmit I/O data. so 

A second way tfie invention can be used is to intro- 
duce switching means to the processor bus. A muttplex- 
ing tecftnique allows to altematively transmit processor- 
cache-traffic or I/O traffic via the processor bus. This 
solution might be favorable when, for some reason, tf)e 55 
processor's pins are not to be accessed . 

Altematively, wfien tfie processor bus is realized in 
tristate fogic comprising a Ngh irrpectence state, the I/O 



adapter can be directty connected to the processor txjs. 
Either tfie processor or the I/O adapter can switch to the 
inactive high impedance state in order not to interfere 
with a data transmission. An advantage of this solution 
is that rK> extra ch^ are required. 

The rate at which I/O traffic can be transmitted via a 
processor bus might differ from the rate at which I/O 
traffic from the external device arrives at the I/O adapt- 
ers. Therefore, it mi^ be advantageous to introduce a 
first irvfirst out buffer, a socalled speed matching butter, 
between I/O adapter and processor bus which can store 
incoming I/O data and which transmits said data at a dff- 
ferent rate to the processor bus, thus matching trans- 
mission speed. 

Further advantages arise with regard to a second 
level cache realized as a shared or clustered shared 
cache structure. In a shared cache structure, the differ- 
ent cache cftips have to t>e connected with data links, 
and these links use pins of the second level cache 
chips. In order to accommodate aD these links, connect 
a fot of processors to the second level cache, and con- 
nect the memory busses to the cache, an implemente- 
tion of the invention provides the necessary pins for 
these tasks: Shared cache concepts can be realized 
wh3e there remain mary pins that can be used for con- 
necting processors to the cache Thus, improved com- 
puter design becomes possible. 

Especially with regard to a coipling facility, an 
implementation of the invention provides advantages. 
While I/O traffic in a regular multiprocessor computer 
system only amounts to about 1% of a typical processor 
bus* bandwidth, data traffic between a coupling facility 
and the external multiprocessor computer systems 
attached to that coupling fadlity occupies up to 10% of 
said processor bus' bandwidth. On the other hand, in a 
coupling fadlity, the data trafffo between cache and 
processor only anxMjnts to about haff of that of a regular 
multiprocessor system. 

According to the inventfon. this co mp aratively big 
data traffc, whx;h wouki tie (fifficult to couple to the cou- 
pling facility's second level cache, can be fed to the cou- 
pling facility via the processor busses. Data transfer 
between said coupling fadlity and an external multiproc- 
essor system is much faster this way than when 
attempting to feed the external data trafffo to the cou- 
pling fadlity via a switch. Therefore, the invention pro- 
vides an effident method for the interconnection of 
multiprocessor computer systenrs to a big duster and 
thus contributes a key feature to the further devetop- 
ment of parallel computing. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The above set forth and other features of the inven- 
tion are made mae apparent in tfte ensuing D^aOed 
Description of the Invention when read in corijunction 
with the attached Drawings, wherein: 
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Figure 1 A is a block cGagramm that shows a mufti- 
processor computer system with dedicated caches, 
including switching means; 

Figure 1B is a block diagram that shows how 
switching means can be introduced to a multiproc- 
essor computer system with a clustered shared 
cache; 

Figure 2A is a block diagram of a multiprocessor 
system, the cache structure t>eing a fully shared 
cache, with the I/O adapters attached to the proces- 
sors; 

Figure 2B is a block diagram of a multiprocessor 
system with a clustered shared cache, the UO 
adapters being connected to the processors; 

Figure 2C is a bkx;k diagram that shows how one 
lyo adapter can be attached to several processor; 

Figure 3A is a bk)ck diagram that shows how I/O 
traffic can be connected to the processor bus via 
switching means; 

Figure 3B is a bkxk dagram that shows how a 
speed matching buffer can be introduced to the 
setup of Figure 3A; 

Figure 4 is a scheme depicting the stmcture of 
commands on the processor bus; 

Figure 5 is a block diagram that shows the bask; 
setup of a computer system compr^'ng several 
multiprocessor computer systems and a central 
coupling facility; and 

Figure 6 is a block diagramm that shows a multi- 
processor computer system that is to be used as a 
coupling faciBty, with the I/O adapters attached to 
the processors. 

DETAILED DESCRIPTION OF THE INVEmX)N 

In Figure 1A and 1B, solutions of the pria art are 
shown which overcome the prot>l em of pincount restric- 
tions by ttie introduction of switching means. In Rgure 
1 A, a miit9)rocessor computer system with incfivklually 
assigned second level caches is shown. The proces- 
sors (100) are connected via processor busses (101) to 
the second level cache chips (102). and there exist data 
links t>etween said second level cacfie ch936 and switch- 
ing means (103). TTie main memory (105) is also con- 
nected via a memory bus (104) to said switching 
means, and so are the I/O adapters (107). The switch 
solves the problem of accomodating both the memory 
bus (104) and the data Enks (106) for I/O traffk; at the 
second level cach& If a processor ^ores data to or 



fetches data from either the memory or an I/O device, 
the read or write access is performed via said switching 
means. 

A further example of a prior art solutk>n is given in 
5 Rgure 1 B. There, a clustered shared cache structure is 
realized, whnh means that all the processors of one 
cluster share a common cacfie assigned to saki cluster. 
Each of the processors (110) of one duster has to be 
connected via processor busses to each of the duster's 
10 second level cache ch^ (1 1 1 ). in order to have access 
to all the data contained in sakJcacha It is obvious from 
Rgure IB that the second level cache cNps already 
have to host a kit of data links, and therefore, memory 
and I/O devk^es have to be cotpled to each duster via 
15 amtching means. Each memory card (114) ist con- 
nected via a memory bus (113) to one of the switches 
(1 1 2). and saki switch also accomodates links to the I/O 
adapters (116). Each switch (112) has data Snks (115) 
to at least one second level cache dip of each duster. 
20 in order to permit access Of sakJ processor duster to all 
memory cards and I/O devrcee. 

According to the inventfon. instead of utilizing 
switching means, such as in Figure 1A and IB. lyO traf- 
fic is fed to the second level cache via processor bus 
25 means. Figure 2A shows an example of a multiproces- 
sor computer system with a shared cache structure, 
where the invention has been irrplemented. 

Each of the processors (200) is conneded via a 
processor bus to each of the second level cache dtvps 
30 (201). A second level cache chip contains memory 
banks (203) whk:h stare the cache lines contained in 
said cache, a directary that keeps track of the focation of 
the different cache lines, and a processor bus switch 
(202) that is responsible for connecting the conred 
35 memory bank to the processor bus. Each second level 
cache chip (201) is conneded via a memory bus ^) 
to a rnernory card (205). cornprising a storage controller 
(206) that controls access to the different DRAM banks, 
merrxwy address drivers, a key stor^e and memory 
40 data multiplexing means. In order to couple I/O traffic 
via the processor and the processor bus to the compu- 
ter system, I/O adapters (207) are attached to Bams of 
the processors (200). Though each processor has to 
accomodato processor busses linking sakJ processor to 
45 each of the second level cache chips, tfiere are usually 
enough processor pins available to host sakI I/O adapt- 
ers^ External devk;es can be coupled to the I/O adapter 
via busses, with the I/O adapters also being respons&)le 
for data format conversion of the incoming data. 
50 Figure 2B shows another embodiment of the inven- 
tfon. Here, a dustered shared cache is realized whuh 
consists of several second level cache chips (211). 
Each of these cadie chips is cormected via a processor 
bus to each of the processors (210) of its duster. Mem- 
55 ory busses (212) provide a Unk between sakJ second 
level cache chips and a common memory (213). In 
order to allow for the transmission of t/0 trafffo via the 
processor bus. I/O adapters (214) are attached to some 
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of the processors (210). Again, the use of the inventian 
permits to omit the switching means that were neces- 
sary In prior art solutions. 

In Figure 20. an 1/0 adapter (221) is attached to 
several of the processors (222). 1/0 traffic from the s 
perpheric devices is transmitted via several txsses 
(220) to the 1/0 adapter (221). The 1/0 traffic is then 
transmitted via several processor txjsses (223) to the 
second level cache. Thus, the amount of I/O traffic 
transmitted via each of the attached processor busses io 
can t)e dynamically acjusted according to tfie amount of 
regular data traffic on that bus. A more efficient use of 
the processor bus ressources and a dynamic load bal- 
ancing is achieved. 

Instead of connecting ihe 1/0 adapters to the proc- 75 
essors, in order to use the processor busses tor trans- 
mission of I/O data to the second level cache, one can 
ateo introduce multiplexing means to the processor bu& 
Figure 3 shows such a solution. A processor (300) is 
connected via processor busses (301) and via said nfuil- 20 
tiplexing means (302) to several second level cache 
chips (303). A data 6nk (304) connects said multiplexing 
means (302) to an I/O adapter 005). wHh busses (306) 
connecting peripheric devices to said 1/0 adapter. Said 
multiplexing means further corrprise artHtration logpc 2s 
which permits access to the processor bus for either the 
processor (300) or the I/O adapter (305). The transmis- 
sion of both 1/0 traffic and regular processorK^che traf- 
fic via a processor bus used in nxMpkex operation 
permits to save valuable pins of tf)e second level cache 30 
chips. 

The rate at which incoming I/O data arrives at tfie 
1/0 adapter, and the rate at which I/O data transmission 
via the processor bus is possible; may cfiffer. Figure 3B 
suggests a solution to ttvs problem. A speed matching 3s 
buffer (307) is included in the date Bnk (304) between 
I/O adapter (305) and rrultiplexer (302). Said speed 
matching buffer (307) provides storage space where 
incoming data can be buffered bdbre it is transmitted 
via the processor bus. 40 

In Rgure 4, tfie structure of instructions and data 
transmitted via the processor bus is sfiown in order to 
demonstrate how regular processor-cache traffic and 
I/O traffic can be interleaved. The activity on tfie proces- 
sor bus is shown as a function of time, with one segment 45 
(400) representing one cyde of the processor. 

In case a processor wants to perform a read access 
to the memory, a "PU Fetch' instruction (402) is issued. 
In a first cyde, the command itself and the length of tfie 
data line to be fetched are transnritted, and in a second so 
cyde, tfie start address of the read operation is given. 

TU Fetch Dab' (404) is the response to said 'PU 
Fetch' (402). In its first cyde, the response field specify- 
ing the type of the response, its target and status is 
transmitted together with an iderrtffication of the proces- ss 
sor that has issued \he conresponcfing 'PU Fetch'. The 
data requested by this processor is then transnrvtted in 
the following cydes. 



When a processor has to perform a write operation 
to the memory, a "PU Store' command (403) is issued, 
with the commarxJ and the length of the data line trarts- 
mitted in a first cyde, with the memory address where 
the data has to be written to specified in a second cyde, 
and with the data itself transmitted in the following 
cyde& 

In case a processor wants to access I/O data, the 
I/O data has to be transferred to the menxMy, and then 
the processor has to access tfie memory by issuing 'PU 
Fetch- and TU Store'. Data transfer between the I/O 
device and main memory is done with the commands 
'I/O Fetch- (401) and "I/O Store' (405). 

'I/O Fetch' (401) is transmitted from the 1/0 device 
via the I/O adapter to the processor bus, in order to fetch 
data from the menrxxy. While the command and the 
length of ttie data line are transmitted in the first cyde, 
the start address of tfie read access is given in the sec- 
ond cyde. 

'I/O Fetch Data' (406) is the response correspond- 
ing to said 'I/O Fetch' command (401). In its first cyde, 
the response field specifying the type of the response, 
its target and status is transmitted together with an iden- 
tification of the I/O adapter that has issued the 'I/O 
Fetch'. The data requested by this I/O adapter is then 
transmitted from the memory to the 1/0 device in the fol- 
lowing cydesw 

An '1/0 Store' command (405) is irvtiated by an I/O 
device in order to write I/O data to the menfx>ry. Again, 
the command specification inducGng the length of the 
data line to follow is transmitted in a first cyde. the mem- 
ory address that specffies where the data has to be writ- 
ten to is given in a second cyde, and the following 
cydes contain the data itself. 

Obviously, there is a lot of similarity between the 
commands that control data exchange between proces- 
sor and main memory, and those that are responsible 
for data exchange between I/O devices and main mem- 
ory. One group of commands does not obstruct the 
other ^oup of commands. Therefore, no problems arise 
when commands of both groups are transmitted via the 
processor bu& 

Especially when several rrujttfxocessor computer 
systems are coined in order to constitute one big sys- 
tem, an implementation of the invention is advanta- 
geous. Figure 5 shows such a system consisting of 
several connected multiprocessor systems that form a 
duster. A central nult^)rocessor system, tfie coi^ng 
tadlity (500). is responsble for the management of data 
flow between the attached multiprocessor computer 
systems (503), for the management of its rattier large 
central memory (505) which each of tfie odemal sys- 
tems can access, and especially for the management of 
the external systems* access rights to tfie data struc- 
tures stored in said central memory. WhOe sfiared data 
structures are contained in the coupling facdit/s central 
menfX)ry (505) , each nultiprocessor system stOI owns its 
own menwry wtiich rt can access. Each of the external 
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multiprocessor systems possesses several I/O adapters 
(504) In order to haixlle data traffic wrtfi I/O devices. The 
t/0 adapters (501) of the coupling facility fulfill a drffererrt 
taste all the data traffic between different multiprocessor 
systems connected via data links (502) to the coupling 5 
facility, arxi all the data traffic t>etween an external mul- 
tipnx;essor syst^ (503) and the co^pGng facility's cen- 
tral memory (505). has to take place via said 1^ 
adapters (501). This is tfie reason ¥vhy tf>e amount of 
external data traffk; a coupling fadlity has to handle w 
exceeds by far the amount of IAD traffic arising in a typi- 
cal multiprocessor computer system. The question is 
hew to couple tftis big amount of external data to the 
coupling facOity, which is basically a regular multproc- 
essor system. In prior art solutions, switches were used 75 
for this purpose, with all the problems that are implied 
with the use of switches. For future couplir)g facilities, 
because of the requirement of a high transmissk>n rate, 
these solutions might rx>t even be possible any more. 

Figure 6 shows hew the mMprocessor computer 20 
system of Rgire 2A can be nrxxiified in order to buiM a 
highi>erfbrmarv;e coupling facOity. In order to do so^ I/O 
adapters (601) are not only attached to some, but to 
each of the processors (600), thus providing the band- 
width necessary for accomodating the required inter- 2s 
system comrrunication links. Because external traffic is, 
according to the invention, transmitted to the second 
level cache via the processor bus, no extra pins of the 
secoTKl level cache are used. Therefore, it is possible to 
realize a shared second level cache concept and thus 30 
connbtne the advantages of a shared cache with the 
availability of eff icient intersystem commuiication links. 
An irrplementation of the invention thus permits con- 
cepts for future high-performance coup&ng facilities 
which would not be possible with switching meansw 35 

Claims 

1. A corrputer system, comprising 

40 

processing means (200); 

cache storage means (201); 

processor bus means for transmitting infomna- 45 
tion between said processing means (200) and 
said cache storage means (201); and 

control means (207) for controlling input/butput 
(1/0) traffic between said corrputer system and so 
external devfoes; 
cfiaracterized in tfiat 

said corrputer system further conrprises trans- 
niissionrneans for transmrttsriglAO traffic to and ss 
from said cache storage means (201) via said 
processor bus means. 



2. A corrputer system according to daim 1, 
characterized in that 

said control noeans (207) are connected to said 
processing means (200). with \J0 traffic being 
transmitted to and from said processor txs 
means via said processing means (200). 

3. A conrputer system acoonfing to claim 1. 
characterized in that 

said control means (305) are connected to said 
processor bus means (301) by switching 
means (302), in a way that either sakj process- 
ing means (300) or said control means (305) 
are connected to said processor bus means 
(301) via ssad switching means (302). 

4. A computer system according to daim 1, 
characterized in that 

said processor bus means are realized in tri- 
state fogk; corrprising a high irrpedance state, 
and that sakJ control means are connected to 
said processor bus means. 

5. Acorrputersystemaccordingtoany of claims2to 
4, 

characterized in that 

said control means (305) further comprise a 
first-in^irst-out buffer (304) in order to match 
the transmission speed of I/O data transfer 
between said control means (305) and said 
processor bus means (301). 

6. A conputer system according to any of daimsl to 
5, 

characterized in that 

said processing means (200) comprise several 
processors, and that said cache storage 
means (201) are irrplemented as shared cache 
storage means, with sakj processors sharing 
saki shared cache storage means in a way that 
data units contained in sakJ shared cache stor- 
age means are accessible to each of sakJ proc- 
essors. 

7. A conputer systm according to any of claims 1 to 
5, 

characterized in tfiat 

sakJ processing means (210) conrprise several 
processors arranged in processor clusters, and 
that sakj cache storage means (21 1 ) comprise 
several cache storages arranged in cache stor- 
age dusters, with the processors of a proces- 
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SOT duster sharing the cache storages of a 
cache storage duster in a way that data units 
contained In said cache storage duster are 
accesslt)le to each of the processors of said 
processor cluster. 5 

8. A computer system complex conprising at least 
two computer systems (503, 500), with one of said 
computer systems iDeing a central computer system 
(500), and with each of said computer systems io 
(503, 500) comprising 

processing means (600); 



comprising 

processing means (200); 

cache storage means (201); 

processor txjs means for transmitting informa- 
tion between said processing means (200) and 
said cache storage means (201); and 

control means (207) for controlling I/O traffic 
between said computer system and external 
devices. 



cache storage means (201); is 

processor bus means for transmitting informa- 
tion k>etween said processing means (600) arvj 
said cache storage means (201); 

20 

control means (601) for controlling intersystem 
information traffic to and from said computer 
system (503. 500); 
t>eing characterized in ttiat 

25 

said computer system complex further com- 
prises intersystem information links (502), with 
said intersystem infonnation finks (502) con- 
necting the control means (504) of each of sanl 
computer systems (503) with the control 30 
means (501) of said central computer system 
(500), with intersystem information traffic 
between said computer systems (503) and said 
central computer system (500) being transmit- 
ted via said intersystem information Imks (502); 35 
and 



11. A method of transmitting I/O traffic in a computer 
system, said computer system comprising 

processing means (200); 

cache storage means (201); 

processor bus means for transnvtting informa- 
tion between said processing means (200) arxl 
said cache storage means (201); and 

cnlrol means (207) for controlling lAO traffic 
between said computer system and external 
devices; 

characterized in that 

said I/O traffic is transmitted to and from said 
cache storage means (201) via said processor 
txis means. 



at least one of said computer systems (503, 
500) furtfver comprises transmission means for 
transnvtting said intersystem information traffic 40 
between the control means (601) and the 
cache storage means (201) of said computer 
system via tfie processor txis means of said 
corrputer system. 

45 

9. A computer system complex according to daim 8, 
characterized in that 



said computer system complex further conh 
prises central niemory means (505) attached so 
to said central computer system (500), with 
data units stored in said central memory 
means (505) being accessible to said other 
computer systems (503) via said intersystem 
information finks (502). ss 

10. Use of processor bus means in a computer system 
for transmitting I/O traffic, said computer system 
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